Artex is AnotheR TEXt summarizer

نویسنده

  • Juan-Manuel Torres-Moreno
چکیده

This paper describes Artex, another algorithm for Automatic Text Summarization. In order to rank sentences, a simple inner product is calculated between each sentence, a document vector (text topic) and a lexical vector (vocabulary used by a sentence). Summaries are then generated by assembling the highest ranked sentences. No ruled-based linguistic post-processing is necessary in order to obtain summaries. Tests over several datasets (coming from Document Understanding Conferences (DUC), Text Analysis Conference (TAC), evaluation campaigns, etc.) in French, English and Spanish have shown that Artex summarizer achieves interesting results.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Three Statistical Summarizers at CLEF-INEX 2013 Tweet Contextualization Track

According to the organizers, the objective of the 2014 CLEFINEX Tweet Contextualization Task is: “...The Tweet Contextualization aims at providing automatically information a summary that explains the tweet. This requires combining multiple types of processing from information retrieval to multi-document summarization including entity linking.” We present three statistical summarizer systems ap...

متن کامل

Development of a Swedish Corpus for Evaluating Summarizers and other IR-tools

We are presenting the construction of a Swedish corpus aimed at research on Information Retrieval, Information Extraction, Named Entity Recognition and Multi Text Summarization, we will also present the results on evaluating our Swedish text summarizer SweSum with this corpus. The corpus has been constructed by using Internet agents downloading Swedish newspaper text from various sources. A sma...

متن کامل

Query-Based Summarizer Based on Similarity of Sentences and Word Frequency

Text summarization is the most challenging task in information retrieval tasks. It is an outcome of electronic document explosion and can be seen as the condensation of the document collection. The use of text summarization allows a user to get a sense of the content of full-text, or to know its information content without reading all sentences within the full-text. Data reduction helps user to...

متن کامل

An Efficient Statistical Approach for Automatic Organic Chemistry Summarization

In this paper, we propose an efficient strategy for summarizing scientific documents in Organic Chemistry that concentrates on numerical treatments. We present its implementation named yachs (Yet Another Chemistry Summarizer) that combines a specific document preprocessing with a sentence scoring method relying on the statistical properties of documents. We show that yachs achieves the best res...

متن کامل

Text Summarization by Sentence Segment Extraction Using Machine Learning Algorithms

We present an approach to the design of an automatic text summarizer that generates a summary by extracting sentence segments. First, sentences are broken into segments by special cue markers. Each segment is represented by a set of predeened features (e.g. location of the segment, number of title words in the segment). Then supervised learning algorithms are used to train the summarizer to ext...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1210.3312  شماره 

صفحات  -

تاریخ انتشار 2012